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Abstract 

Background: Shigella flexneri is the major cause of bacillary dysentery in tlie developing countries. Tlie 
lipopolysaccliaride (LPS) 0-antigen of 5. flexneri plays an important role in its pathogenesis and also divides 
5. flexneri into 19 serotypes. All the serotypes with an exception for serotype 6 share a common 0-antigen backbone 
comprising of N-acetylglucosamine and three rhamnose residues. Different serotypes result from modification of the 
basic backbone conferred by phage-encoded glucosyltransferase and/or acetyltransferase genes, or plasmid-encoded 
phosphoethanolamine transferase. Recently, a new site for 0-acetylation at positions 3 and 4 of Rha'", in serotypes la, 
1 b, 2a, 5a and Y was shown to be mediated by the oacB gene. Additionally, this gene was shown to be carried by a 
transposon-like structure inserted upstream of the adrA region on the chromosome. 

Results: In this study, a novel bacteriophage SflOl, encoding the oacB gene was isolated and characterised from a 
serotype 7a strain. The complete sequence of its 38,742 bp genome encoding 66 open reading frames (orfs) was 
determined. Comparative analysis revealed that phage SflOl has a mosaic genome, and most of its proteins were 
>90% identical to the proteins from 12 previously characterised lambdoid phages. In addition, the organisation of 
SflOl genes was found to be highly similar to bacteriophage Sf6. Analysis of the SflOl OacB identified two amino acid 
substitutions in the protein; however, results obtained by NMR spectroscopy confirmed that SflOl -OacB was functional. 
Inspection of the chromosomal integration site of SflOl phage revealed that this phage integrates in the sbcB locus, 
thus unveiling a new site for integration of serotype-converting phages of 5. flexneri, and determining an alternative 
location of oocB gene in the chromosome. Furthermore, this study identified oacB gene in several serotype 7a isolates 
from various regions providing evidence of 0-acetyl modification in serotype 7a. 

Conclusions: This is the first report on the isolation of bacteriophage SflOl which contains the 5. flexneri 0-antigen 
modification gene oacB. SflOl has a highly mosaic genome and was found to integrate in the sbcB locus. These 
findings contribute an advance in our current knowledge of serotype converting phages of S. flexneri. 
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Background 

The burden of bacterial dysentery due to shigellosis con- 
tinues to be a major concern affecting more than 165 
million people annually. S. flexneri is the primary cause 
of endemic shigellosis prevalent in the developing coun- 
tries, and is the most frequently isolated species world- 
wide [1]. 
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In S. flexnerU the O-antigen structure of the LPS, is 
one of the key virulence determinants required for its 
pathogenesis and protection against the host defence [2] . 
The O-antigen of S, flexneri, with the exception of sero- 
type 6, consists of a polysaccharide backbone comprised 
of the repeating tetrasaccharide units (N-acetylglucosa- 
mine and three rhamnose residues), and modifications 
to the basic O-antigen backbone by the addition of glu- 
cosyl, O-acetyl, or phosphoethanolamine groups to one 
or more sugars, give rise to various different serotypes of 
S, flexneri [3-6]. To date at least 19 serotypes of S, flex- 
neri have been identified [7]. 
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The genes responsible for glucosylation of S. flexneri 
O-antigen, gtrA, gtrB and ^^r(type), are arranged in a 
single operon known as the gtr cluster [8-13]. Whereas, 
the acetylation is mediated by a single gene 'oac' encoding 
O-acetyltransferase [14]. While the glucosylation is known 
to occur on any of the sugar residues, oac-Qncoded 
O-acetylation occurs at position 2 of Rha^ residue. Ad- 
ditionally, the glucosyl or the O-acetyl modification genes 
are carried by bacteriophages, which mediate serotype- 
conversion by integrating into the host chromosome [3]. 
Six such serotype converting phages have been identified: 
Sfl, Sfll, SflV, SfV, and SfK, which encode the gtr gene 
cluster, and Sf6 which encodes the oac gene [10,15-18]. 
In contrast to the phage-encoded glucosylation and 
O-acetylation, the phosphoethanolamine modification at 
position 3 of Rha" and/or Rha"^ is encoded by a plasmid- 
borne opt gene [6]. 

Recently, new sites for the O-acetylation of S, flexneri 
O-antigen have been identified. These are O-acetylation at 
position 3 (major) and 4 (minor) of Rha"^ (3/4-O-acetyl- 
ation) in serotypes la, lb, 2a, 5a, Y, 6 and 6a, and at 
position 6 of GlcNAc in serotype 2a, 3a, and Y [19-22]. 
Moreover, the degree of 3/4-O-acetylation has also been 
shown to vary between the ranges 30-70% (at position 3) 
and 15-30% (at position 4), within the strains of one 
serotype [21]. Another recent study revealed that the 
3/4-O-acetylation modification in S, flexneri is mediated 
by the oacB gene, which was shown to be carried by a 
transposon-lil<e structure located upstream of the adrA 
gene on the chromosome [23] . In this study, we report the 
isolation and characterisation of a novel bacteriophage 
SflOl, from a wild type serotype 7a strain, and show for 
the first time that oacB gene was located on the intact 
genome of the prophage, SflOl. The complete genome se- 
quence of the SflOl phage was determined. Comparative 
genomics used for highlighting important genetic simi- 
larities with other lambdoid phages indicated that the 
SflOl phage has a mosaic genome and Sf6-like genome 
architecture. Analysis of a specific chromosomal integra- 
tion site for phage SflOl revealed that the phage integrates 
into the sbcB locus, thus identifying a new site for the in- 
tegration of the serotype converting phages of S, flexneri, 
and a new location of oacB in .S. flexneri chromosome. 
Additionally, this study also identified oacB gene in several 
serotype 7a isolates from various geographical regions, 
suggesting that the 3/4-O-acetylation modification is not 
uncommon in serotype 7a of S, flexneri. 

Methods 

Bacterial strains, bacteriophage and media 

The induction of bacteriophage SflOl from serotype 7a 
S, flexneri strain SFL1683 was performed using UV irra- 
diation protocol described by Adam et al. [8]. Bacterio- 
phage stocks were prepared by picking a single plaque. 



propagating on serotype Y strain (SFL124) [24], and pre- 
cipitating phage using polyethylene glycol, as described in 
Sambrook et al. [25]. S, flexneri strain SFL1691 (serotype 
7a) lacking the 3/4-O-acetylation on the Rha"^ was used 
as host for SflOl-OacB functional analysis. E, coli JM109 
was used for cloning experiments. 

Bacteria were grown in Luria-Bertani (LB) broth or LB 
agar supplemented with Chloramphenicol (25 (ig/ml) or 
Erythromycin (250 (ig/ml) when appropriate. NZCYM 
broth was employed for routine propagation of phage. 

DNA methods 

SflOl phage DNA was isolated from purified phage stock 
by treatment with Proteinase K as described by Sambrook 
et al. [24]. Bacterial genomic DNA was isolated using GE 
Healthcare genomic DNA isolation kit (GE Healthcare), 
according to the manufacturer s instructions. Plasmid iso- 
lation was performed using the Axyprep Plasmid Miniprep 
kit (Axygen Biosciences). Primers used in this study were 
synthesized by Sigma-Aldrich and are listed in Additional 
file 1: Table SI. PGR amplification was performed using 
the PfuUltra II Fusion HS DNA Polymerase (Stratagene) 
according to the manufacturer s directions. Purification of 
the PGR products was achieved by using the Wizard SV 
Gel and PGR Glean Up System (Promega). DNA sequen- 
cing was performed using Big Dye Terminator v3.1 Gycle 
Sequencing kit as recommended by the manufacturer and 
were run on an AB 3730 capillary sequencer at Biomedical 
Resources Facility, John Gurtin School of Medical Re- 
search, Australian National University. 

SflOl oacB was amplified using SflOl phage DNA as 
template, using primers SflOl-OacB-Fwd and Rev. The 
purified product was then cloned into vector pBG SK + 
(Stratagene) to generate plasmid pNV2073. Erythromycin 
gene, PGR amplified from plasmid pTRKH2 using Em 
primer pair was then cloned into pNV2073 to generate 
plasmid pNV2074. The recombinant plasmids were trans- 
formed by electroporation and maintained in JM109 cells. 
pNV2074 was also introduced into S, flexneri strain 
SFL1691 to produce SFL2516. 

Electron microscopy 

The purified phage was absorbed on carbon-coated cop- 
per grids, negatively stained with 2% phosphotungstic 
acid (pH 7.0) and visualized with a Hitachi H7000 trans- 
mission electron microscope. 

Host range determination 

The host range of phage SflOl was determined as de- 
scribed previously [17]. Briefly, different dilutions of 
phage stock were made in SM buffer; 5-10 \A of each 
dilution was then spotted on required bacterial lawn on 
an agar plate. Lytic activity (clear zone encompassing the 
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phage drop) was examined following overnight incuba- 
tion at 37°C. 

Sequencing of phage genomic DNA 

The purified SflOl phage DNA was sequenced using the 
Ion Torrent PGM 314 chip (Life Technologies) at the 
Australian Genome Resource Facility, University of 
Queensland. The reads generated were assembled into 
contigs using CLC Genomics Workbench (Ver 5.5.1, 
CLC Bio). To close the gaps between the contigs and 
re-sequence the regions of low-quality, the desired seg- 
ments were PGR amplified using phage DNA as tem- 
plate and the purified PGR products were sequenced as 
described above. 

Analysis of sequence 

Open reading frames were determined using GLG Main 
Workbench (Ver 5.5.1, GLG Bio) and NGBI ORF finder 
program. Genes within orfs were predicted based on 
homologies to the known genes found by BlastN and 
BlastP searches and the presence of Shine-Dalgarno 
ribosome binding sites. The Rho-independent termina- 
tors were identified using ARLOND terminator finding 
program [26] and tRNAscan-SE search server was used 
to scan for tRNA [27]. 

The protein level alignments were performed using Glus- 
talW [28] and BioEdit Sequence Alignment Editor [29]. 
The accession numbers for the phages used for compara- 
tive genomics were: GUS-3 (GP000711), Sf6 (NG_005344), 
HK544 (NG_019767), mEP043 c-1 (NG_019706), SfX 
(NG_017328), mEp213 (NG_019720), mEp234 (NG_019 
715), P22 (NG_002371), epsilon34 (NG_011976), ImelO 
(NG_019501), HK446 (NG_019714), Phage 2851 (FM 
180578). 

Preparation of 0-polysaccharide and NMR spectroscopy 

Lipopolysaccharide of strains SFL1683, SFL1691 and 
SFL2516 were isolated by the phenol water extraction of 
dried bacterial cells [30]. Lipid-free polysaccharide (PS) 
was prepared by treatment of the LPS with dilute acetic 
acid followed by purification using gel permeation chro- 
matography [31]. 

NMR spectra of the LPS in D2O solution were 
recorded at 80°G on a Bruker Avance II 500 MHz 
spectrometer. ID and 2D NMR experiments [32] of the 
PS were recorded in D2O solution on Bruker Avance 
500 MHz and Bruker Avance III 700 MHz spectrome- 
ters, equipped with 5 mm TGI Z-Gradient GryoProbes, 
at 24°G for strain SFL1683 and at 20°G for strain 
SFL2516. Data processing was performed using vendor- 
supplied software. Ghemical shifts are reported in ppm 
using internal sodium 3-trimethylsilyl-(2,2,3,3-^H4)-pro- 
panoate (TSP, 5h 0.00) or external 1,4-dioxane in D2O 
(6c 67.40) as references. 



Availability of supporting data 

The nucleotide sequence of SflOl phage reported in this 
article has been deposited in the GenBank database as 
accession number KJ832078. 

Results and discussion 

Isolation of SflOl 

A novel bacteriophage named SflOl was induced and 
isolated from serotype 7a, strain SFL1683. Phage SflOl 
formed clear, round plaques when plated onto the indi- 
cator strain SFL124 (serotype Y). In order to confirm 
that SFL1683 is the host strain of SflOl phage. Southern 
hybridization of Hindlll digested genomic DNA of the 
host, using DIG-labelled SflOl as a probe was per- 
formed. Results confirmed the presence of phage ge- 
nome integrated into the host chromosome (data not 
shown). 

Morphology of SflOl 

Electron microscopic examination of SflOl phage prepar- 
ation revealed that it has an isometric head (ca. 50 nm) 
and a short tail (ca. 16 nm) (Figure 1). According to the 
morphological classification of Bradley, these characteris- 
tic features are typical of phage in the G group mor- 
phology of the family Podoviridae and order Caudovirale 
[33]. Among the other serotype converting phages of 
Shigella, appearance of SflOl phage resembles phage Sf6 
and SfX [34,35]. 

Host range 

Twelve serotypes (la, lb, 2a, 2b, 3a, 3b, 4a, 4b, 5a, 7a, X, 
and Y) of S, flexneri were tested for sensitivity against 
SflOl phage. Together, SflOl was able to infect 4 




Figure 1 Electron micrograph of phage Sf101 negatively 
stained with 2% phosphotungstic acid. Scale bar, 50 nm 
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serotypes: la, lb, 3b and Y. Interestingly, we observed 
some differences in the phage ability to form plaques on 
different serotypes. SflOl plating efficiency was similar 
on serotypes Y and la; however, it was 10^ fold lower 
when plated on serotypes 3b and lb. In a number of 
short- tailed phages, adsorption is mediated by an initial 
reversible binding of the phage to bacterial LPS, followed 
by an irreversible attachment of the phage to an un- 
known secondary receptor, the presence of which has 
been found to be necessary for infection. Recently, 
OmpA and OmpC were identified as the secondary re- 
ceptors for bacteriophage Sf6, and were shown to dra- 
matically affect the rate and efficiency of bacteriophage 
Sf6 infections [36]. Bacteriophage SflOl being similar to 
Sf6 might also require presence of secondary receptors 
for infection, and the difference in the plating efficien- 
cies observed might be due to mutation in the secondary 
receptors of these strains. 

Genome properties and organisation 

The complete genome of SflOl phage was sequenced 
using 314 chips on an Ion torrent PGM, producing an 
average read length of approx. 125 bp. A total of approx. 
21,069 reads were generated which were then de-novo 
assembled into 6 major contigs using CLC Genomics 
Workbench (Ver 5.5.1, CLC Bio). The accuracy of the 
contigs was verified by mapping the reads back to the 
contigs. The regions of ambiguity and the gaps between 
the contigs were then addressed by targeted Sanger se- 
quencing to obtain a single contiguous sequence. 

The complete genome of SflOl phage consists of 
38,742 bp, with the overall G + C content of 47.4%. A 
total of 66 protein coding genes with a plausible Shine- 
Dalgarno sequence were predicted from the genome se- 
quence. Among which 41 orfs are transcribed from the 
sense strand and 25 are on the antisense strand. The 
majority of the orfs initiate translation with an ATG, 
whereas only 6 start with GTG. The sizes of the gene 
products varied from 24 a.a. to 911 a.a. 

The translated o// products were also compared with 
known protein sequences using BlastP (Additional file 2: 
Table S2). Based on the similarities, 43 out of 66 orfs 
were assigned putative functions, while the other 23 orfs 
exhibited similarity to uncharacterised proteins. The 
overall genetic organisation of SflOl phage (Figure 2a), 
consisting of "DNA packaging and structure - serotype 
conversion - regulation - recombination - replication - 
nin and the lysis region", suggested that this phage is a 
member of lambdoid family. Additionally, the layout of 
the genes was similar to that of previously sequenced 
serotype converting Shigella phages Sf6 and SfX. 

The SflOl genes were generally tightly spaced occu- 
pying 94.5% of the genome. However, there were several 
large (-200 to 600 bp) apparently non-coding regions 



within the genome (e.g. between orfs 16-17, 17-18, 
19-20, 23-24 and 37-38). Although no tRNA genes were 
identified in these or other locations by using a tRNA 
scanning program, several putative Rho-independent 
transcription terminators were identified and are shown 
in Figure 2a. 

Relationship to other phages 

Initial whole genome Blast of phage SflOl against the 
NCBI database showed that bacteriophage SflOl is re- 
lated to several lambdoid phages originating from va- 
rious hosts, like S, flexneri, E, coli and Salmonella, To 
better understand the relatedness, comparison of pro- 
teins encoded by SflOl phage to those of the 12 most re- 
lated phages was carried out. Figure 3 shows that most 
of the proteins encoded by the phage SflOl were >90% 
identical with proteins from previously characterised 
lambdoid phages. Moreover, the genome of phage SflOl 
is highly mosaic with the left half of the phage most 
homologous to E, coli phage CUS-3 and the right half 
most homologous to phage Sf6 and HK544. 

The module containing DNA packaging and structural 
proteins is the largest module of the SflOl phage, and is 
predicted to contain 15 orfs {orf 1 to 15), While the 
packaging proteins of SflOl were identical to Sf6, most 
of the proteins responsible for phage head assembly 
shared >90% identity with their counterparts in CUS-3 
(Figure 3). One of the exceptions was ORF9 in the head 
completion module. orf9 with two other genes of the 
module encode for proteins required for stabilizing the 
phage head after DNA gets loaded in it. Although, ORF9 
shared only 50% identity with its equivalent in CUS-3, it 
was found to be >90% identical to its homologue in 
phage P22. The major difference between CUS-3 and 
SflOl head morphogenesis proteins, was in the DNA in- 
jection module. SflOl ORFll, 12 and 13, forming this 
module showed only 85%, 34% and 32% identity to their 
equivalents in CUS-3. In general, the injection proteins 
are known to be more variable than the rest of the virion 
assembly proteins because these proteins are released by 
the virion during DNA injection and function in the 
host during or after the injection [37]. 

orf 15 of SflOl encodes the tail spike protein that 
adsorbs to the cell receptors (O-antigen) of the host. 
Analysis of the tail spike protein of SflOl phage revealed 
conservation in the amino terminal portion than in the 
carboxyl terminal region. The amino terminal residues 
were homologous to the cognate amino acid residues of 
CUS-3, Sf6, SfX, and epsilon34 (90, 90, 90, and 85% 
identity, respectively). However, very limited homology 
was observed between the carboxyl terminal residues. 
The reason for this could be that the amino terminal do- 
main of the P22 like tail spikes are known to bind to 
virion head and is therefore more conserved. The 
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Figure 2 Genome map and integration site of phage SflOI. (a) The SflOl genome is shown with a scale in bp. orfs are represented by 
arrows oriented in the direction of transcription. Putative functional modules are indicated above the arrows. Black knobs below the scale depict the 
Rho-independent terminators and the attachment site of the phage (ottP) is indicated with a blue vertical arrow, (b) Schematic representation of 
phage SflOl integrative recombination where attP on the phage genome recombines with the ottB within sbcB gene of 5. fiexneri chromosome to 
form the two junction's recombination sites attL and attR. Arrows PI (sbcB-dwn-att-Fwd) and P2 (Orf22-up-Rev) on the map indicate the primers used 
in the PGR. 



carboxy terminal domain, on the other hand, contains 
the O-antigen binding site and so is specific for each 
phage [37,38]. The integrase of phage SflOl (encoded by 
orfl8) shared 99% identity with the integrases of HK544, 
mEp234 and phage 2851. Moreover, further analysis of 
SflOl genome sequence has revealed a 13 bp sequence 
(5'CGTAATCGTGAAA 3') located upstream of the int 
gene, that is, identical to the attP of phage 2851 [39], 
While the integration sites and attP sequence of HK544 
and mEp234 are not known, phage 2851 has been shown 
to integrate in the sbcB locus [39]. The extremely high 
similarity of the SflOl phage attP and Int protein to 
those of phage 2851 suggested that the integration site 
of SflOl phage must be at the sbcB locus, identical to 
phage 2851. We amplified and sequenced one of the 
SflOl prophage-bacterial genomic junctions that spans 



between SflOl orf22 and sbcB gene (Figure 2b). A 3 kb 
sequence was obtained in which 13 bp attL sequence 
was identified; thus confirming the integration of SflOl 
phage into the 5' end of sbcB gene (encoding 3'-5' exo- 
nuclease). This is unlike all the other known serotype- 
converting phages of S. fiexneri which are known to 
integrate either into the agrW or thrW tRNA gene 
[10,15-18]. 

The early and regulatory regions located in the right 
half of the genome shared homology to CUS-3, Sf6, 
HK544, mEp043c-l, mEp213, mEp234 and HK446. 
However, the proteins of regulation and the replication 
module were most similar (>90% identical) to their cog- 
nates in HK446 and mEp043c-l, respectively. The nin 
region of the SflOl phage (ORF46-54) was identical to 
that of phage HK544, with ninX being the only 
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Figure 3 Comparison of SflOl with related phages. Proteins encoded by SflOl phage were compared with those of 12 other lambdoid 
phages from E. coll, S. flexnerl and Solmonello. The coloured arrows represent the level of amino acid sequence identity between SflOl protein 
and its counterpart in other phages. The red and green colours indicate >90% and >80% identity, respectively. The purple arrows on SflOl map 
represent proteins with no phage-borne homologue. 



exception showing only 43% identity. Moreover, nin E, F, 
G, H, and I of SflOl phage were also conserved in Sf6, 
mEp213, mEp234 and HK446 phages. 

The antitermination Q and a set of predicted lysis 
genes lie downstream of the nin region. The lysis mo- 
dule of phage SflOl is typical of lambdoid phages, con- 
sisting of holin, anti-holin, lysin, Rz and Rzl proteins 
(encoded by orfs 57-61), Additionally, orfSl of SflOl 
phage lies inside orfSO, in a second reading frame, as in 
other lamdoid phages [38]. Homologues of holin, anti- 
holin, lysin, and Rz were seen in all the phages shown in 
Figure 3, except for phage P22 and phage 2851. In 
contrast, equivalents of Rzl of phage SflOl were only 
present in CUS-3, Sf6, P22 and epsilon34. To the right 
of the lytic cassette, orfs 62, 64, 65 and 66 encoded pro- 
teins which shared >90% identity with their homologues 
in SfX and Sf6, whereas orf63 encoded Rha family pro- 
tein was identical to its cognate in phage CUS-3. 



Novel genes of SflOl phage 

SflOl phage proteins encoded by orfs 16, 17, 41 and 56, 
were identified as proteins with no known phage-borne 
homologues. Whilst ORE 17 and 56 shared similarity 
with uncharacterised proteins of E. coli, ORE41 had no 
homologue in E, coli or Shigella and showed limited 
homology with proteins from Saprospira grandis and 
Bipolar is maydis, BlastP results of ORE 16 revealed that 
it is an acyltransferase family protein, sharing 99% iden- 
tity with the protein encoded by gene SE0315 (named 
oacB) of S. flexneri 2a str 301. OacB has recently been 
shown to confer the 3/4-O-acetylation modification of 
Rha"^ oi S, flexneri O-antigen [23]. 

Pairwise alignments of SflOl orfl6 {SilOl-oacB) with 
the oacB at the DNA and protein levels were carried out. 
The analysis identified three base substitutions (G486 to 
A, C692 to T and G796 to A) in the SilOl-oacB. While 
the first mutation had no effect at the protein level, the 
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other two mutations resulted in two amino acid substitu- 
tions in SflOl-OacB: A231 to V and V266 to I (Additional 
file 3: Figure SI). Since, A, I and V are all neutral amino 
acids with similar structure, it was expected that these 
mutations will impose minimum effect on the function of 
OacB. This was verified by performing NMR spectroscopy 
experiments. 

Confirming the function of SflOI-OacB 

In order to confirm the function of Sf 101 -OacB, we first 
analysed the LPS of the SflOl host (SFL1683) by NMR 
spectroscopy. Results revealed NMR resonances at 
2.21 and 2.17 indicating the presence of O-acetyl groups 
[40,41]. oacB gene from SflOl phage was then cloned into 
a plasmid (pNV2074) and introduced into a serotype 7a 
strain (SFL1691), which was PGR negative for oacB gene, 
to create a recombinant strain SFL2516. The NMR 
spectrum acquired on the LPS of strain SFL2516 revealed 
resonances, inter alia, at 2.21, 2.17 and 2.15; however, 
NMR signals were absent in the spectral region for 
O-acetyl groups for the SFL1691 LPS. 

LPS of strains SFL1683 and SFL2516 were then treated 
by dilute aqueous acetic acid and purified by size exclu- 
sion chromatography to obtain the polysaccharide mate- 
rials (PS). The NMR spectra of these PS showed 
resonances, inter alia, at 2.21 (0.8 H), 2.16 (0.5 H), 2.11 
(0.9 H) and 2.07 (2.1 H) for the SFL1683 strain (Figure 4a) 
and at Sh 2.22 (0.8 H), 2.17 (0.7 H), 2.13 (0.8 H) and 2.08 
(2.2 H) for the SFL2516 strain, indicating partial O-acetyl 
substitution. The O-acetyl substitution position(s) were 
then determined by performing the ^H,^H-TOCSY experi- 
ments employing an array of mixing times of increasing 
lengths, which have proven to be particularly powerful at 
unraveling the spin-systems of the sugar residues and sub- 
sequently the O-acetyl substitution position(s) [42]. Using 
mixing times in the range 10-120 ms, the ^H,^H-TOCSY 
spectra of the PS from strain SFL2516 revealed, inter alia, 
spin systems originating from Sh 1.29 (H6 of Rha"^3Ac) to 
3.79 (H5), 3.53 (H4), 5.09 (H3) and 4.26 (H2), from 
1.16 (H6 of Rha"W) to 3.86 (H5), 4.80 (H4), 4.10 (H3) 
and 4.22 (H2), as well as from 1.26 (H6 of Rha"^) to 
3.68 (H5), 3.34 (H4), 3.87 (H3) and 4.14 (H2) (Figure 4b). 
The ^H,^H-TOCSY spectra of the PS from strain SFL1683 
were closely similar, and taken together these results were 
fully consistent with the NMR data reported by Wang 
et al. [23] for partial O-acetyl substitution at positions 03 
and 04 of Rha"^ in O-antigens from other S. flexneri sero- 
types. Thus, O-acetylation was present to about 1/4 at 03 
and to about 1/5 at 04 in the two strains investigated 
herein. 

Identification of conserved residues and motifs of OacB 

BlastP results of SflOl-OacB also revealed that this pro- 
tein shared some homology with acyltransferases from 




2.20 2.15 2.10 2.05 
^H/ppm 




1.35 1.30 1.25 1.20 
''H /ppm 

Figure 4 NMR. Selected regions of (a) NMR spectrum of the PS 
from strain SFL1683 sliowing resonances from 0- and N-acetyl groups 
and (b) ^H/H-TOCSY (T^ix 120 ms) NMR spectrum of the PS from strain 
SFL2516 showing spin systems originating from the H6 protons in the 
methyl groups of rhamnosyl residues. Annotations are given with 
respect to the O-acetylation pattern in the 0-antigen. 

V ) 
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other species. All the homologues were grouped under 
the superfamily COG 1835 (acyl_transf_3). Multiple align- 
ment of the SflOl-OacB with 20 of its homologues from 
various species was carried out for identification of the 
conserved residues and motifs. Results of alignment 
(Additional file 4: Figure S2) showed several conserved 
motifs: DGxRGxLAxxVxxHH, FFxITG(YorF)LFxxK, 
WxLxYEWxFY and YSXYLxHG (where x is any amino 
acid), present at similar positions of the proteins. The 
first 3 motifs were found in the amino terminal part of 
the protein while the fourth one was in the carboxy 
terminus. Several other conserved amino acids were 
also identified in close proximity to these motifs. The 
high levels of conservation of various amino acid motifs 
suggests that they may be involved in specific conserved 
functions. 

Distribution of oacB in S. flexneri 7a strains 

The 3/4-O-acetyl modification observed in this study was 
in a serotype 7a strain. However, serotype 7a is known to 
lack any O-acetyl modification to date [21]. Thus, several 
serotype 7a isolates from different regions (4 from 
Bangladesh, 9 from Egypt, 1 from Sweden, 31 from UK, 
and 14 from Vietnam) were screened to identify a possible 
3/4-O-acetylation modification. As the antiserum specific 
for 3/4-O-acetyl modification was not available, a PGR 
screening of 59 serotype 7a strains with oacB specific 
primers was performed. As shown in Table 1 the expected 
PGR product was amplified from 7 out of 59 strains 
(1 isolate each from Sweden, UK, and Egypt, and 4 from 
Bangladesh). The sequencing of the PGR product from 
the positive strains revealed that the oacB gene sequence 
in 2 strains (1 from Egypt and 1 from Sweden) was identi- 
cal to oacB of SflOl phage (Additional file 3: Figure SI). 
However, the oacB gene in the other 5 strains was almost 
identical to oacB from S. flexneri 2a str 301 except a base 
substitution at position 1146, which resulted in 1382 to M 
mutation in the OacB protein of all 5 strains. However, as 
I and M are neutral amino acids, this mutation is expected 
to have no effect on the function of OacB. These results 
suggest that the 3/4-O-acetyl modification in serotype 7a 
is not uncommon. Moreover, in order to determine if 
oacB gene in these 7a strains was carried by SflOl phage, 
PGRs targeting three different regions of SflOl phage 



{orfl5'Orfl6, orf30'Orf39, and orf60'Orf66) were performed 
on the above 7 oacB positive strains. Results revealed that 
1 out of the 7 tested strains carried complete SflOl phage 
suggesting this phage may have contributed in the dissem- 
ination of the oacB gene in 7a strains (Table 1). 

Integration site of SflOl phage suggests that the oacB 
gene in SflOl lysogens is located in the sbcB locus. How- 
ever, the reported site for oacB gene in serotype la, lb, 
2a, 5a and Y is upstream of the adrA gene (which en- 
codes diguanylate cyclase AdrA) [23]. Thus, the location 
of oacB gene in the 7 oacB positive serotype 7a strains 
was determined by performing PGR using primer spe- 
cific for oacB gene and adrA, Results indicate that the 
location of oacB in the SflOl lysogen was the sbcB locus, 
while in the other 6 strains oacB was located upstream 
of the adrA gene. Ghange in the chromosomal position 
of oacB from sbcB to adrA locus is likely due to disrup- 
tion of the prophage by IS elements and mobilization of 
oacB due to the combination of 2 or more IS elements 
flanking oacB, This hypothesis is supported by our ana- 
lysis of the region flanking sbcB gene in 10 previously 
sequenced S. flexneri genomes obtained from the NGBI 
database. Results revealed that the region upstream of 
sbcB gene is extremely variable containing insertion ele- 
ments in all, and SflOl like Int in 8 out of 10 strains. 
Taken together, our analysis indicate that a possible 
presence and disruption of SflOl phage by IS elements 
could have resulted in the mobilization of oacB in some 
and loss in the other strains. Additionally, Wang et al. 
have also reported that the oacB gene located upstream 
of adrA in serotypes la, lb, 2a, 5a & Y was carried by a 
transposon-like structure [23]. 

Conclusion 

This study reports the isolation and characterisation of a 
novel bacteriophage SflOl from S, flexneri serotype 7a 
strain. Gomplete genome sequence of SflOl phage was 
determined and found to contain functional oacB gene 
making it a serotype converting phage of S, flexneri. The 
comparative genomic analysis of SflOl to 12 other 
lambdoid phages revealed that bacteriophage SflOl has a 
highly mosaic genome and Sf6-like genome organisation. 
SflOl was found to integrate in the sbcB locus repre- 
senting a new genomic location of oacB gene in SflOl 



Table 1 PCR screening of oacB in serotype 7a isolates from various regions 


Serotype 7a isolates 


Number of strains 


Number of oacB 


Number of SflOl phage 


Location of 


obtained from 


screened 


PCR positive strains 


PCR positive strains 


oacB 


Bangladesh 


4 


4 


0 


Upstream of odrA 


Egypt 


9 


1 


1 


SbcB 


Sweden 


1 


1 


0 


Upstream of odrA 


UK 


31 


1 


0 


Upstream of odrA 


Vietnam 


14 


0 
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lysogen. Additionally, this study for the first time identi- 
fied oacB gene in several serotype 7a isolates from differ- 
ent regions, providing evidence of 3/4-O-acetylation 
modification in serotype 7a of S. flexneri. These findings 
will further our understanding on the serotype conver- 
ting phages of Shigella which will be useful in com- 
prehending the role these bacteriophages play in the 
survival of S, flexneri in the environment and human 
host. 
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